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Abstract — Multiple antenna (MIMO) devices are widely used 
to increase reliability and information bit rate. Optimal error 
rate performance (full diversity and large coding gain), for 
unknown channel state information at the transmitter and for 
maximal rate, can be achieved by approximately universal space- 
time codes, but comes at a price of large detection complexity, 
infeasible for most practical systems. We propose a new coded 
modulation paradigm: error-correction outer code with space- 
only but time-varying precoder (as inner code). We refer to 
the latter as Ergodic Mutual Information (EMI) code. The EMI 
code achieves the maximal multiplexing gain and full diversity is 
proved in terms of the outage probability. Contrary to most of 
the literature, our work is not based on the elegant but difficult 
classical algebraic MIMO theory. Instead, the relation between 
MIMO and parallel channels is exploited. The theoretical proof of 
full diversity is corroborated by means of numerical simulations 
for many MIMO scenarios, in terms of outage probability and 
word error rate of LDPC coded systems. The full-diversity and 
full-rate at low detection complexity comes at a price of a small 
coding gain loss for outer coding rates close to one, but this loss 
vanishes with decreasing coding rate. 

Index Terms — Quasi-static multiple input multiple output 
channels, outage probability, full-rate and full-diversity space- 
time coding, signal space diversity, error-correcting codes. 



I. Introduction 

Multiple antennas (MIMO) has become an important means 
to combat channel fading as well as to increase channel 
capacity. Since the pioneering work ||25l , Il42l . a great amount 
of research effort has been focused on characterizing the 
fundamental limits of MIMO communications that are now 
well understood in many aspects. In particular, when the 
channel is subject to slow fading, a fundamental tradeoff as 
to how to optimally exploit the multiple antennas has been 
characterized in ||52l . 

Code design problems for multi-antenna channels without 
channel state information at the transmitter have first been 
addressed by Tarokh et al. in |j40l , in which the notion of 
space-time codes was introduced. By spanning the message 
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over both spatial and temporal dimensions, full spatial diver- 
sity can be achieved. The elegant construction of orthogonal 
space-time codes enables a simple decoding at the receiver side 
121, 1401 . Unfortunately, the benefit of simplicity is obtained 
at the cost of rate; more specifically, at most one independent 
symbol per channel use can be transmitted. Furthermore, 
the rate of orthogonal space-time codes decreases with the 
number of transmit antennas, while the ergodic capacity of 
a MIMO channel linearly increases with the minimum of 
the number of receive and transmit antennas. While the 
highly structured codes have a poor spectral efficiency, the 
random coding argument |fl8l , ||52l suggests that there exists 
a space-time code with finite length that can achieve the 
fundamental diversity-multiplexing tradeoff (DMT). Guided 
by the diversity-multiplexing tradeoff, approximately universal 
pm . i.e., DMT optimal for all channel statistics, structured 
space-time codes designs have been proposed Pjj, |fl9l , |20l , 
Ipffl . The key to obtain approximately universal space-time 
codes with a finite code length is the non-vanishing properties 
of the product of the min{n.T, smallest singular values 
of the codeword matrices for increasing signal-to-noise ratio, 
where t%t and ur are the number of transmit and receive 
antennas, respectively. In particular, this criterion coincides 
with the non-vanishing determinant criterion when tit = tlr 
HH, (2Ql- Interestingly enough, when the MIMO channel is 
diagonal, i.e., a parallel channel, the criterion is reduced to 
the product-distance criterion, well known for code design for 
single-antenna block fading channels (3), (6), PHI . 

In general, the MIMO channel can be decomposed by the 
singular value decomposition into a diagonal channel preceded 
by a random unitary matrix rotating the channel input. Despite 
the diagonal channel being present in the singular value de- 
composition, there is a fundamental difference between MIMO 
and parallel channels when the channel state is unknown 
at the transmitter side. In fact, by carefully examining the 
criterion for approximate universality lEOl . [HP . constraints 
on the codeword structure are imposed in order to guarantee 
a "good" performance for the worst-case rotation incurred by 
the channel matrix. Such a phenomenon does not exist for 
diagonal/parallel channels. To guarantee good performance in 
any MIMO channel, precoding over at least rSj- dimensions 
is needed for MIMO, while we only need to code over 
tit dimensions for parallel channels. A direct consequence 
of the increased precoding dimensionality is the decoding 
complexity. While there exist efficient decoding algorithms 
for such problems (see ||28l . Il34l and references therein), the 
complexity still grows in polynomial time with the dimension 
for the best-case scenarios and in exponential time for the 
worst cases. 

The advantages of approximately universal space-time code 
designs are quite clear: high spectral efficiency, high diversity 



2 



gains even without channel codes, and short block length. 
On the other hand, error-correction codes with relatively 
large block lengths are used in most modern communica- 
tion systems. As a result, a common paradigm of coding 
for multi-antenna systems is the concatenation of an error 
correction code (also referred to as outer code) and a space- 
time code (also referred to as space-time precoder, or inner 
code). That is, a long coded block is split into many sub-blocks 
that are space-time precoded individually. While the precoder 
is designed without the consideration of an outer channel code, 
the actual error performance depends on multiple space-time 
blocks. Although using a well-designed space-time precoder 
can only improve the overall error rate performance, it comes 
at the price of high decoding complexity, as mentioned above. 
In fact, it turned out in some cases that the use of space- 
time codes in conjunction with channel codes does not bring 
extra gain but unnecessary decoding complexity (e.g. when 
the coding rate of the channel code is sufficiently low to 
recover a portion of the spatial diversity ll26l ). Therefore, the 
conventional paradigm, to design space-time codes without 
considering the presence of an outer code, may not be efficient 
when it comes to the trade-off between performance and 
complexity. 

Inspired by the connection between MIMO and parallel 
channels (the MIMO channel is a parallel channel with random 
precoder) as well as the observation that the outer codeword 
spans over a large amount of channel uses, we propose a 
new coded modulation paradigm: error correction codes with 
space-only but time-varying precoder. The rationale behind 
this is to avoid the worst-case rotations (denoted as corrupt 
precoders in this paper) that are the main factor of deterioration 
in uncoded MIMO communication. The time-varying aspect 
of the space-only precoder reduces the effect of the worst- 
case rotations to a fraction of the transmission time, which 
can then be compensated by the outer code. Note that space- 
only precoders have been considered in the literature when 
channel state information was available at the transmitter (e.g. 
through feedback), see OTl and references therein. 

The main contribution of this paper is as follows. We first 
observe that the MIMO channel corresponds to a parallel 
channel with a precoder that is random, but then remains 
fixed once it is chosen. We then extend the framework of 
signal space diversity for parallel channels to explain why full 
transmit diversity is not achieved with fixed space-only codes 
for MIMO. This new framework allows us to prove Theorem 
[U claiming that time-varying space-only precoders achieve 
full diversity on any MIMO channel, thereby proposing an 
alternative to approximately universal space-time codes with 
a much smaller detection complexity. 

The rest of the paper is organized as follows. The system 
model, notation and problem statement are presented in Sec. 
HTland we also clearly define what we mean by full -rate space- 
time codes. In Sec. [HI] we interpret MIMO as parallel channels 
with a random precoder and explain that this randomness 
requires an extension of the study on signal space diversity for 
parallel channels. We first illustrate this extension by means 
of a toy example in Sec. [IV] where the channel gains, the 
precoder elements and the transmit symbols are all real-valued. 



Next, the extension of the signal space diversity framework is 
formalized in Sec. [V] by introducing the concepts of bad and 
corrupt precoders. This new framework is then used to prove 
that fixed space-only codes do not achieve full diversity (Sec. 
I VIb in contrary to time-varying space-only precoders (Sec. 
I VIA . Theorem [TJ is then corroborated by presenting extensive 
numerical results in Sec. I VIII I 



II. System model 

Notation: we write scalars, vectors and matrices as x, x 
and X. X> and are the Hermitian transposes of X and x. 
The Landau symbols f(n) — 0(g(n)) and f(n) = fl(g(n)) 
respectively denote f(n) < kg(n) and f(n) > kg(n) for some 
positive k. The equation sign f(j) = g(j), introduced in l52l . 
is equivalent to lim^^ '"^^ = lim^oo '"^g^ ■ Similar 
meanings hold for < and >. 



A. Channel model 

We consider a point-to-point MIMO channel H = [hij] G 
£n R xn T w j tn Ut t ransm i]; antennas and receive antennas, 
where hij ~ CAf(Q, 1) is the complex path gain from transmit 
antenna j to receive antenna i. We assume that all path gains 
are independent. The channel state information is perfectly 
known at the receiver side, but unknown at the transmitter 
side, i.e., no feedback channel is available. The channel is 
assumed to vary slowly, so that it remains constant during 
the transmission of at least one outer codeword. A channel 
use is referred to as the event where the transmitter sends 
riT symbols simultaneously from its tit transmit antennas. 
Assuming a total of iV c channel uses per codeword, the 
discrete-time complex baseband equivalent channel equation 
is given by 

y t = V7#x f +w f , t = l,...,N c , (1) 

where y t , w t £ C nR x 1 denote the received vector and noise 
vector at instant t, and x t £ C" T x 1 denotes the symbol vector 
transmitted at instant t. The additive white Gaussian noise 
vector wt has i.i.d. entries, wt.i ~ CAf(0, 1). The transmit 
vector Xt satisfies E[||xt|| 2 ] = 71t,V t. This way, 7 is the 
average signal-to-noise ratio per symbol (SNR) per transmit 
antenna^. The instantaneous mutual information is denoted as 
I(x t ;yt\H) and depends on the channel realization H. 

The channel realization H can be decomposed by a singular 
value decomposition as H = UHV', where U G C" hX " h and 
V G c»itx«t are unitary matrices, uniformly distributed in the 
set of all unitary matrices (see App. [A] for more background), 
and £ G ]R"« xnT is a diagonal matrix with the non-negative 
singular values Cj,i = 1, . . . ,mm(nn, Ut), of H on its 

diagonal. As in 1221 . l52l . we define the ordered normalized 

1 2 

fading gains a = [a u . . . , a min(nRi „ T) ], where a, = — 

OL% > ct2 > • • • > cimin(n R ,n T )- The joint distribution p(a) is 

'in some papers, the average signal-to-noise ratio at each receive antenna 
is considered, which is riT'J- 
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given by 123, (52) 



P (a) = /V- 1 (l0g 7 )" T 7 -(l"H-nT| + l)a I 



i=l 

(n Rl 7i T ) 



7 1 



(2) 



j>i 



Because H is known at the receiver, the following transfor- 
mation can be performed, 

Yt = U% = V7^x t + w tl i=l,...,JV c (3) 

where w t follows the same distribution as w t . Because the 
transformation matrix U< is invertible, no information is lost 
by this transformation (which can be proved by for example 
the data processing inequality ifTOl ) , i.e., J(x t ; y t |E, = 

i(z*;y t \H). 

Let us now consider (O in the cases tir > tit and tir < 
rvr, where S is not square. When tir > tit and £ is tall, the 
last tir — iit received symbols in y t contain only noise due to 
the fact that the bottom tir — tit rows of £ are zero. Hence, 
when we consider tir > iit in the following, then, allowing 
an abuse in notation, y t , w t and £ refer to the top tit rows 
of the actual vectors y t , w t and £, respectively. As a result, 
an equivalent tit x tit MIMO channel is obtained, of course 
taking into account the actual singular value distribution as 
a function of tir and tit (see (f2]i). When tir < tit and E 
is fat, the last tit ~ riR columns of £ are zero, so that the 
last tit — n r columns of V are not important, which will be 
considered in Sec. IVIII 

The outage probability Q, ll3~7l is expressed as 



P ou , = Pr 



Pr(E t [/(x t ;y t |E,y)] < R) 



(4) 
(5) 



where R is the spectral efficiency and E( [.] denotes the 
temporal mean. The SNR-exponent of the outage probability, 
known as the diversity order, is 

log Pout 



lim - 

7— >oo 



log 7 



(6) 



where we assumed fading distributions where the limit in (|6) 
exists. 

B. Full rate space-time and space-only coding 

In this paper, full-rate STCs are considered where we 
define full-rate codes as schemes that transmit, on average, a 
linear transformation of mm(riT,riR) independent Af-QAM 
symbols at each channel use, corresponding to m coded bits 
per channel use, where m = min(nTj tir) log 2 M. Hence, the 



number of channel uses per codeword is N c 



where Nb 



is the length of a codeword of the outer code. 

Note that it is possible to convey tit independent symbols 
per channel use, which is larger than min(nx, tir) when 
riR < tit- However, transmitting Tam.(nr,nii) independent 

2 Note that the dependence created by the outer error-correcting code is 
neglected when we use "independent" in this context. 



symbols maximizes the multiplexing gain (the speed at which 
the constellation size may grow with the SNR 7) that can be 
achieved without leading to a degradation of the error rate for 
increasing SNR l52l . Secondly, the capacity of an tit x Hr 
MIMO channel is the same as the capacity of an tir x tit 
MIMO channel (hence with tir transmit antennas and tit 
receive antennas) [24), ll42l . B71 . so that transmitting more 
symbols on the tit x tir MIMO channel than the maximum 
that can be conveyed on the tir x tit MIMO channel would 
not correspond to the limits of the channel, and thus lead to 
a significant loss in coding gairQ. 

STCs are characterized by the number of space and time 
dimensions, tit and k, respectively, denoted as an tit X k 
STC. When the number of time dimensions k reduces to 
one, the STC becomes a space-only code. An tit x k 
STC (consisting of riTk elements) yields k symbol vec- 
tors {x(/_i) fc+ i, . ■ • , X(i-i)/c+ fc }, often represented through an 
tit x k STC matrix Xi, where Xi :U ,i = xn_x)k+u,ii * = 
1, . . . ,n T ; u = 1, . . .,k;l = 1, . . ., N c /k. 

We first consider the case that ur > tit- The STC Xi is 
obtained from mk coded bits through a sequence of operations. 
The sequence of mk coded bits is split in nyfc groups ofm/riT 
bits which are mapped to one of M points, M = 2 m /™ T , 
belonging to an M-QAM constellation Cl z . Denoting as z/ the 
riT^-dimensional symbol vector that results from mapping the 
l-th block of mk coded bits, the linear precoding involves 



X V ec : i = Pi%u i = 1, . • • , N c /k, 



(7) 



where P; is a well chosen unitary matrix in C nkxnk 
and E[|zjj| 2 ] = l,j — l,...,riTk. Next, the n T k- 
dimensional vector x vec ,; is split in k column vectors of 
length tit, yielding the STC matrix Xi or the k vectors 
{x({_i)fc+i,...,X(i_i)fc+fc}. Because E[|z/,j| 2 ] = 1 and P 
is unitary, the components Xt,i satisfy E[|x4.i| 2 ] = 1,V i, t. 

In the case that tir < tit, full-rate codes transmit, on 
average, a linear transformation of tir independent A/-QAM 
symbols each channel use. Hence, z; only contains riRk 
independent components. For example, (tit — n>R)k compo- 
nents of Zf can be put to zero. To satisfy the constraint that 
E[||x t || 2 ] = riT, V t, we set the mean square magnitude of the 
TiRk non-zero components of z; equal to 221. 

C. Problem formulation 

The approximately universal STCs that are optimal in terms 
of uncoded error rate for an tit x tir MIMO channel are 
tit x tit full-rate full-diversity STCs (hence k = tit)- The 
precoder p = P is constant, has dimension n T x n T and 
its n T elements are optimized to maximize the coding gain 
(see for example g), flS), fl26l, EO), Eg), (|40l). A prohibitive 
objection is that the detection complexity of optimal STCs 
is very complex, e.g., it increases exponentially with ri T for 
exhaustive ML-detection. For example, when 16— QAM and 
tit = 3 transmit antennas are used, the detection complexity is 
0(16 9 ). Although a recent study shows that the complexity of 

3 Consider the simple case of a 1 X 1 channel where transmitting a linear 
combination of two independent symbols each channel use would also lead 
to a large loss in coding gain. 
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near-ML decoding can be reduced to growing in polynomial 
time with the dimension for the best-case scenarios, by using 
lattice reduction and linear preprocessing ||28"1 , it is yet to 
be verified whether the same conclusion holds with error 
correcting codes and soft decoding. 

We propose full-rate time-varying space-only codes, hence 
k = 1 so that Pi only has dimension tit X nr, but Pi 
varies with In Sec. IVHI we prove that full diversity 
is achieved. Because the STC is space-only, the notation 
simplifies: x t = Ptz t , t = 1,...,N C , where z ti £ Cl z , 
i = 1, . . . , min(nT, tir). Hence, z t belongs to f2 z = (f2 z )" T , 
which is the Cartesian product of riy constellations f2 z . 
We denote our new precoder type by the EMI code, where 
EMI (Ergodic Mutual Information) refers to the temporal 
mean of the mutual information. The detection complexity 
of exhaustive ML-detection now increases exponentially with 
min(nT, tir) which is the lowest possible ML detection com- 
plexity for full-rate STCs. For example, when O z = 16— QAM 
and tit = tir = 3, the detection complexity is 0(16 3 ) for 
exhaustive ML-decoding, which is feasible. 

The overall received signal-to-noise ratio per information bit 
is denoted as where Ei, is the ratio of the energy of the 
received symbol vector and the number of information bits 
conveyed per transmitted symbol vector, so that jf- = 7 = 

fq-nfergr^ > where R = mR c is the spectral efficiency and R c 
is the coding rate. When the constraint E[||x t || 2 ] = ny,V t, 
is satisfied, then the SNR is 7 = RE »/ N ° . 

III. MIMO VS. PARALLEL CHANNELS 

The channel equation (01 is that of a parallel channel £ 
(which corresponds to mm(riT,nji) parallel channels), with 
precoded input V'Xt, with the particular feature that the 
precoder is random. We denote the input of this parallel 
channel by 

x t = V^Sc t , (8) 

which belongs to a discrete constellation that is random 
(through V) and variable in time (through the time-varying 
precoder P t ). Allowing a small abuse in notation (by dropping 
the time-index), we denote this constellation by £l x , so that 
x t e £l x . It is clear that x t is a linear transformation of z t , 

x t = yt PtZt = VtZu (9) 

where the precoder V t = V' Pt is random and varies in time 
within the duration of a codeword if P t is time-varying. When 
P t is deterministic or uniformly distributed in the set of all uni- 
tary matrices (denoted as Ai(nx,nx)) and independent from 
V, then Vt is also uniformly distributed in A4(nT,riT) (see 
Lemma[5]in App.lAl. Hence, in the case that a fixed space-only 
precoder P t = P is used, the MIMO channel corresponds to 
a parallel channel with a random precoder V'P, and when 
our proposed time-varying space-only precoder Pt is used, 
the MIMO channel corresponds to a parallel channel with a 
random time-varying precoder Vt. 

For a given channel realization and a given channel use, £7 X 
is fixed and we can determine the mutual information between 
input and output of this parallel channel, I(x t ; yt|£, V) = 



I(5tt;y t \H). Inserting x t in Eq. (01, we have that 

y t = V7Sx t + w t) i=l,...,JV c , (10) 

Note that only the top min(nr, tir) components of the column 
vector x t are important, by the structure of £; hence, the 
complete vector x t is considered when ur > ut, and the 
top ur components of x f otherwise. 

The mutual information /(xt; yt|E, V) between input and 
output of a parallel channel is well known (22), B31 and 
recalled in Eq. (fT2l , where d 2 (c, d) = Y27=i \ Ci ~ ^*l 2 - Using 
the normalized fading gains, the mutual information can be 
expressed as in Eq. ( fT3l l, where 

f(CH,St,i,W t ,i) = e -T 1 - a M 8 t,d 2 -2 V ^'^M< i } ) (11) 

and where Stj = (x ty i — x' t 4 ) and 7Z{.} takes the real part. 

The parallel channel model with precoding is well known 
for the study of signal space diversity (SSD) (see ||6] for 
uncoded and 0131 . ||23l for coded communication over parallel 
channels). In SSD, full transmit diversity is achieved when V 
is chosen so that st.i 7^ 0, i = 1, . . . , mm(riT , tir) ,V t, where 
St = x£ - Xt; x£ = Vtz'.Xt = V t z;z, z' e fi z , z' ^ z. 
Mostly, the fading gain distribution that is considered for 
parallel channels is Rayleigh fading, which yields a maximum 
transmit diversity of min(riT, tir). However, the fading gain 
distribution of the singular values in E is not Rayleigh l33l . 
Il42l . yielding a maximum transmit diversity of nr, which will 
be made more formal in Sec. IVIII 

Consider a parallel channel with a constant deterministic 
precoder, Vt — V co . For any coding rate R c , it holds that full 
transmit diversity is not achieved when V co is a bad precoder. 

Definition 1: We define bad precoders Vb a d as the set of 
precoders so that 3 i E {1, . . . , min(nT, tir)}, z, z' 7^ z, 
satisfying s t s = 0. 

More importantly, if V co is not a bad precoder, full transmit 
diversity is achieved (see (6) for more background). This is 
well known but it is particularly interesting for the following 
reason. 

Consider a constant space-only precoder P t = P, so that 
Vt = V'P = V is random but not time- varying within 
the duration of a codeword. By Lemma in App. [A] the 
distributions of V' and V are the same, hence the distributions 
of H and HP are the same. As a consequence, the space-only 
precoder P achieves the same error rate performance as for 
uncoded MIMO (without precoding, or with P = I), which 
only achieves a diversity order of tir (which is proved in Sec. 
I VP . Contrary to parallel channels, the loss of transmit diversity 
is not caused by bad precoders, which have a zero probability 
of occurrence due to the continuous space of the realizations 
of V. 

Hence, the notion of bad precoders, established in the SSD 
framework for parallel channels with constant deterministic 
precoders, needs to be extended to the case of random pre- 
coders, to explain this loss of transmit diversity (also de- 
noted as spatial diversity). This extension will be exemplified 
through a toy example (in which the constellation Q z and 
the precoder are taken real valued, allowing a geometrical 
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I(x t ;y t |E,y)=m-2-" 1 ]T E yt , Xt 



log 2 ( 2 exp [(d 2 (y t , ^Sx t ) - d 2 (y t , 

x[gO x 



2- m £ E Wf 

x t E!ix 



iog 2 ( 5z n^( ai ' st ' j ' wt ' 1 ) 



(12) 



(13) 




Fig. 1: The system model of the toy example is illustrated. 
The coded bits are first mapped to a point in the constellation 
f2 z (empty circles), which is transformed to a point in the 
constellation Sl x (filled circles). The projections of the points 
of Sl x on the coordinate axes (marked by crosses) illustrates 
the component interleaving. The first and second component 
of x t , xt,i and x t ,2, Vi, are affected by the fading gains j3i 
and fa, respectively. 



illustration) in Sec. [IV] and will be formalized in Sec. IVlfor 
MIMO. 

IV. Toy Example 

For the toy example, we consider a classical system model 
with two flat non-ergodic parallel channels with Rayleigh 
fading and BPSK symbols at the precoder input (see for 
example J6], fl2l ). which allows a geometric illustration. The 
system model is 



Ut.l = y/jftlXt,l + W t ,l 

Vt,2 = y/ifoxta + W t ,2 



t = l. 



,N r . 



(14) 



where /3i and are i.i.d. and Rayleigh distributed, ^[/3f] = 1, 
[xt.i xt,2] T = Q[zt,i z t,2] T where Q is a standard two- 
dimensional rotation matrix parametrized by the angle 9 
and Zt,i G {±1}, and where Wt,i ~ Af(0, 0.5). Because 
each component of x t is transmitted on another fading gain, 
this scheme is also denoted as component interleaving. The 
rotation of z is illustrated in Fig. [TJ According to Def. [TJ 
bad precoders are rotation matrices Q with rotation angle 
@ € #bad = {kir/4, k = 0, . . . , 7}, in which case some crosses 
in Fig. [TJ coincide. 

In the literature, only deterministic rotations were studied. 
Here, we study two other cases: 

• 9 is random, but fixed once it is chosen (Sec. HV-At ; 



• 9 is random and independently generated at each channel 
use (Sec. ITV^Bb . 

The first case is similar to a non-ergodic MIMO channel, 
where the channel is random but remains constant during the 
transmission of an outer codeword. The second case is similar 
to a non-ergodic MIMO channel with a time-varying space- 
only precoder P t at its input. 

A. Random rotation 

It is well known (6), iTOl . ll23l that a diversity order of 
two (denoted as full diversity in this section) is achieved for 
any coding rate R c < 1 (including uncoded communication) 
when a fixed deterministic rotation matrix Q is used and s^i ^ 
0, V t, i, V z' 7^ z, where s t = x t — xj. In the context of MIMO, 
it is more interesting to consider a random rotation matrix Q. 
Despite the fact that Pr(0 £ 6> ba d) = when p{9) = ^, full 
diversity is not achieved as proved by the following lemma. 

Lemma 1: In a point-to-point fiat parallel Rayleigh fading 
channel as given in ( TBI with a fixed but random precoder Q 
where 9 follows a uniform distribution in [0, 2ir], the diversity 
order can not be larger than 1.5 for any coding rate 0.5 < 
Rc < 1. 

Proof: See App. M ■ 
From the proof of Lemma [TJ it is clear that corrupt 
precoders (and not bad precoders) are the main cause of the 
full diversity loss. We formally define corrupt precoders in Sec. 
[VI but for this section, consider corrupt precoders as the set of 
rotations where 9 6 [0,7~ a5 ] 0- The probability to have such 
a corrupt precoder is proportional to 7~°- 5 . When a precoder 
is corrupt, then the mutual information is strictly smaller than 
one in the event of one bad fading gain (see App. [B] for a 
formal definition of bad fading). In this case, a coding rate 
larger than one-half leads to a spectral efficiency that is larger 
than one, yielding an outage event. The probability of a bad 
fading gain, which yields an outage event in conjunction with 
a corrupt precoder, is proportional to 7 -1 . The probability of 
corrupt precoders is unfortunately non-negligible. Lemma [TJ is 
corroborated by means of numerical simulations, presented in 
Fig. [2] When the rotation angle is zero, we have a bad precoder 
and no transmit diversity is achieved. When the rotation angle 
is constant and different from {/c7r/4}, full transmit diversity 
is achieved. However, when the rotation angle is random, full 
transmit diversity is not achieved. 

4 Note that corrupt precoders may also be defined as the set of rotations 
where \0 - fcf | < 7~ ' 5 for k = 1, . . . , 7; but then, for k = 1, 3, 5 and 
7, the lower bound on the coding rate yielding a diversity order of 1.5 is 

log 2 (3) 
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6 — degrees ■ 
8 = uniform in [0, 2it] 
= 27c 




16 19 22 
E b /N [dB] 

Fig. 2: The outage probability of a parallel channel with 
two states does not achieve full diversity when the input 
constellation is rotated with a rotation angle that is random 
(uniform in [0, 2tt]) but fixed once it is chosen. For reference, 
the outage probabilities with a fixed rotation (with = 
and 27 degrees, corresponding to a bad and good rotation) are 
given. Other simulation parameters: Vt z =BPSK and R c = 0.9. 



B. Random time-varying rotation 

The effect of corrupt precoders can be easily reduced by 
using multiple random precoders, say N, during the trans- 
mission of one codeword. We leave a formal description for 
Sec. lVIII but qualitatively describe what happens in the case of 
the toy example. The probability to have one bad fading gain 
behaves as — (the probability of having two bad fading gains 
is not dominant, as it behaves as \ and thus automatically 
leads to full diversity). Given that one fading gain is bad, the 
mutual information at high SNR converges to one and two for 
corrupt and non-corrupt precoders, respectively. The mutual 
information is averaged over these N precoders. We have 
to consider at most one corrupt precoder as the probability 
to have a corrupt precoders behaves as 7~ 0,5 , so that two 
corrupt precoders in conjunction with a bad fading gain also 
automatically yield full diversity. In that case, the mutual infor- 
mation for large SNR, averaged over N precoders, converges 
2(1 — 27v), so that any coding rate smaller 



to 2^- + -i- 

than 1 — 577 yi e lds full diversity 



A similar reasoning is valid for MIMO, which is formalized 
in the next sections. 



V. Bad and Corrupt precoders for MIMO 

The definition of bad precoders was given in Def. Q] Bad 
precoders decrease the maximal mutual information that can 
be achieved. Listing the bad precoders is laborious and is 
fortunately not required for the theoretical analysis of fixed 
and time-varying space-only codes in the remainder of the 
paper. For completeness, we illustrate the bad precoders for 
n, = 4-QAM and a 2 x 2 MIMO channel in App. 

Bad precoders are defined to yield at least one component 
of St with a zero magnitude. The following lemma proves that 
at most ut — 1 components of Sf can have a zero magnitude. 

Lemma 2: At most — 1 elements of s t can have a 
magnitude equal to zero. 



Proof: If ut elements of s t have a zero magnitude, then 
3z,z'/ z, satisfying x t = x.' t , which is impossible, because 
Vt is a bijection (see Lemma [5] in App. lAl. ■ 

As illustrated in the toy example in Sec. IIVI the loss of 
transmit diversity for space-only codes is caused by corrupt 
precoders, which is an uncountable set of precoders that 
behave similarly as bad precoders, in the sense that a number 
of components of s t have a vanishing magnitude with the 
SNR, instead of being zero for bad precoders. We distinguish 
between two sets of precoders, S c .i and S C: 2, which we both 
refer to as corrupt precoders. The first set is relevant in the 
case where tir > tit- 

Definition 2: We define the set S c .\ of corrupt precoders 
as the set of precoders so that 3z,z'/ z, satisfying \s t .i\ < 
7 -°- 5 ,V i > 1. 

For the case that tir < tit, we define corrupt precoders as 
follows. 

Definition 3: We define the set S c ^ of corrupt precoders 
as the set of precoders so that 3z,z'/ z, satisfying \st,i\ < 
7 -°- 5 ,i = l,...,n R . 

Even without determining the structure of these precoders, 
we can derive their probability of occurrence. 

Lemma 3: The probability that a random precoder falls in 
the set of corrupt precoders is 

-(flT-l) 



Pr(S c 



=7 



Pr(S c , 2 )=7~ 



(15) 



for the sets iS Cj i and <S Cj 2, respectively. 

Proof: See App. |Ejj ■ 
In the remainder of the paper, we use the notation S c , which 
refers to S c> i or S C} 2, when ur > tit and tir < tit, 
respectively. As we will see in Sec. I VII the probability Pr(6> c ) 
is non-negligible and causes the loss of transmit diversity. 

VI. Fixed space-only codes 

Consider a unitary but fixed space-only precoder P t = P, 
hence Vt is random (uniformly distributed in the set of all 
unitary matrices M (tit, tit) (Lemma |5]l), but fixed once it is 
chosen. Because Vt remains constant during the transmission 
of an outer codeword, we drop the index t in the vectors x t , y t 
and w t , and denote V = V^P. A new channel H' = ITEV 
is formed, which has the same distribution as H because V 
has the same distribution as V^ . As a consequence, a fixed 
space-only precoder does not achieve transmit diversity. 

In Sec. |nij the relation between MIMO and parallel 
channels was given. More specifically, a MIMO channel is 
equivalent to a parallel channel with random precoding. The 
probability of having a bad precoder (Def. Q3 is zero, but 
corrupt precoders cause the diversity loss, which is formalized 
in the following lemma. 

Lemma 4: In a point-to-point fiat fading ut x tir MIMO 
channel with a fixed tit x tit precoder P t = P, there exists 
a coding rate R c < 1 above which the receive diversity is 
achieved (i.e., the outage SNR exponent is d out = tir) due to 
corrupt precoders V' E S c . 

Proof: See App. |F] ■ 
In accordance with the terminology of parallel channels with 
precoding, the loss of transmit diversity is caused because the 
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random precoder falls too often in the set of corrupt precoders, 
or more precisely, because the probability to have a corrupt 
precoder does not converge fast enough to zero. Note that 5 c ,i 
contains precoders with constraints on tit — 1 components 
of St. A larger set, having constraints on less than riy — 1 
components can be defined as well, but leads to a less tight 
lower bound on the outage probability when using the same 
proof techniques as in App. [F] 

Adopting a time- varying precoder P t , uniform in the set 
of unitary matrices, within the coherence time of the channel 
(thus, assuming that the MIMO channel remains constant) has 
no effect on the diversity order for an uncoded scenario with 
respect to a fixed space-only code P. The reason is that the 
new channel H' = UTiVt still has the same distribution as H, 
with V t = V^P t . 

VII. Time-varying space-only codes, achieving 

FULL RATE AND FULL DIVERSITY 

We propose to reduce the effect of corrupt precoders by 
averaging the mutual information over N unitary precoders 
Vt = V' Pt during the transmission of a codeword. We create 
N realizations Vt by changing P t after every channel uses. 
When N is finite, the time-varying space-only code is denoted 
as EMI-N code. In this section, we assume that N = N c 
and N c — > oo; the corresponding time- varying space-only 
code is denoted as EMI code. The convergence properties of 
the EMI-iV code are discussed in Sec. IVIII-BI By letting P t 
being uniformly distributed in the set of all unitary matrices 
M.{riT,riT), the matrix Vt is also uniformly distributed in 
M(tit,tlt) (Lemma . Furthermore, by drawing P t inde- 
pendently at each channel use, Vt changes independently at 
each channel use. 

Note that the bottom tit — tir elements of z are zero when 
n u < tit, see Sec. IH-BI 

Theorem 1: In a point-to-point fiat fading tit X tir MIMO 
channel, using a coding rate R c < 1 and using tit X tit 
precoders P t , randomly generated for each channel use, be- 
ing uniformly distributed in the set of all unitary matrices 
M(itt,tit), full diversity is achievable. 

Proof: See App. |G] ■ 
Note that in d59l ) in App. [G] we used the fact that the temporal 
mean equals the sample mean, i.e., 

E t [/(x t ;y t |E,F t )] =E[/(x t ;y t |S,y t )], (16) 

where the expectation at the right side in (fT6] l is over Vt, uni- 
formly distributed in the set of unitary matrices M.{tit,tlt), 
for a fixed t. This is valid for the EMI code. 

Theorem [TJ corroborates experimental results in the litera- 
ture. For example, in (271 . (32), the precoder Pt was made 
variable by multiplying an initial precoder Pq by a diagonal 
matrix A t including e? 2 *'^*' on the z-th diagonal element, 
where 6i(t) varies each channel use. The overall precoder 
PoA t is unitary and time-varying^. Because the multiplication 
with A t corresponds to adding a time-varying phase to the 
complex baseband signal at each transmit antenna, the scheme 
was referred to as phase sweeping. Simulation results in (27), 

5 Note however that PoAt is not uniform in the set of unitary matrices. 



(32) suggest that full diversity is achieved in the presence of an 
error-correcting code with a particular coding rate. Theorem [TJ 
now proves that full diversity is actually achievable (through 
the outage probability) for codes with any coding rate smaller 
than one, as long as P t is uniform in the set of unitary matrices. 

VIII. Numerical results 

In this section, we corroborate Theorem [TJ by numerically 
determining the outage probability for the EMI- AT code, which 
are also compared with the outage probabilities of approx- 
imately universal space-time codes. Despite being an unfair 
comparison taking into account the detection complexity, it 
allows to assess the loss of coding gain as a price for the 
reduced detection complexity associated with the EMI code. 
The outage probabilities are achievable lower bounds of the 
word error rate (WER) of practical coded systems. Therefore, 
it is useful to compare the outage probabilities with the WER 
of a coding scheme having the EMI code as inner code and 
an error-correcting code as outer code. 

In Sec. lVHLAl we discuss the optimization of LDPC codes 
as outer code in such a coding scheme. Next, the convergence 
properties of the EMI- A code to the EMI code are discussed 
in Sec. IVIII-BI In the last two subsections, we present the 
numerical results for the rtr x tir MIMO channel with ur > 
tit and tir < tit, respectively. 

The outage probabilities and WERs of the LDPC coded 
modulations are determined by means of Monte Carlo simu- 
lations. If the decoding complexity of exhaustive ML-detection 
of the STCs was too high, then sphere decoding is performed 
ID, G), ED- More specifically, soft output sphere decoding 
is performed 0, so that the Tanner graph of the LDPC code 
gets a soft input from the detectors of the STC. Iterative 
decoding and detection is performed, where after every 10 
LDPC decoding iterations, a new detection of the STC is 
performed. The LDPC code is decoded by means of the sum- 
product algorithm on the Tanner graph. The total number of 
decoding iterations is limited by 100. 

The computation of ( fl2] i in the numerical calculation of the 
outage probability is in some cases (e.g. for approximately 
universal STCs, large constellation size or too many transmit 
antennas) time consuming because the constellation O x might 
be very large. Therefore, the inner sum over x! t 6 f2 x in (TTST i 
is simplified by using a sphere decoder. The sphere decoder 
outputs a list of maximum likelihood metrics d 2 (y t , y^SxJ) 
for all closest constellation points to the constellation point 
with the highest likelihood fT|. The list size is limited to 1000. 

A. LDPC code optimization for the EMI code 

We focus on binary LDPC codes C[Nb, Kbh with dimension 
Kb, so that the coding rate R c = jp. Irregularity is introduced 
through the standard bit and check node degree distributions, 
characterized by the polynomials \(x) and p(x), here from an 
edge perspective [39). 

It is worth citing |43) . Il50l . (5D and references therein, 
where the degree distributions of LDPC codes have been op- 
timized for Gaussian and ergodic MIMO channels. However, 
optimizing these degree distributions for a non-ergodic MIMO 
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channel (hence, where the random channel remains constant 
during the whole codeword) has not yet been performed to 
the best knowledge of the authors. In Sec. [TTlJ we have 
explained that the non-ergodic MIMO channel is equivalent to 
a parallel channel with a random precoder. To the authors' best 
knowledge, only lfT2l presented a tool to optimize the degree 
distributions of an LDPC code for parallel Rayleigh faded 
channels with precoding. We therefore utilized the techniques 
from 1 12 1 to generate the degree distributions of the LDPC 
code. 

We briefly describe the techniques developed in lfT2l . where 
coded modulations (including the precoding matrix, the map- 
ping function and the error-correcting code) were optimized 
to yield a WER closely approaching the outage probability. 
The off-line optimization, done using a geometric approach, 
was limited to at most B + 1 times the effort for Gaussian 
channels, where B is the number of parallel channels, here 
min(nr, tir). The geometric approach involves optimizing the 
coded modulation, e.g. via EXIT charts, in B + 1 well chosen 
points in the B-dimensional fading plane. More specifically, 
in B fading points on the B axes of the fading plane close to 
the outage boundary, plus one fading point on the ergodic line 
(ai = . . . = qb). The outage boundary limits the set of fading 
gain vectors for which the instantaneous mutual information 
is smaller than the spectral efficiency. More details on the 
outage boundary and the optimization of the outage probability 
through a proper selection of the precoding matrix and the 
constellation O z can be found in iTPJl . 

The main difference between the current channel equation 
(see Eq.fTOil and the channel model in |12| is that the precoder 
Vt is random. Hence, the instantaneous mutual information 
7(x f ; y t |S, Vi) does not only depend on the instantaneous 
realization of S, but also on the precoder realization Vt (Fig. 
l3bl shows the variation of the instantaneous mutual informa- 
tion with Vt). Because of the dependence on Vt, an outage 
boundary for the singular values {er,;, i = 1, . . . , min(riR, ut)} 
cannot be defined based on I(x t ; y t |S, Vt). However, when the 
EMI-code is used, the mutual information is averaged over all 
unitary precoders in Ai(riT, n?), so that the mutual informa- 
tion E t [/(x t ; y t |E, 14)] only depends E. We define the corre- 
sponding outage boundary as the set of singular values {(Ji,i = 
l,...,mm(n R ,n T )} which yield E t [7(x t ; y t |S, Vt)) = R. 
In practice, we approximate E f [7(x t ; y t |E, 14)] by averaging 
I(xt; y*|E, Vt) over a large number (e.g. 100) of randomly 
generated matrices Vt- Fig. [3a] illustrates such an outage 
boundary. As a consequence, the same method as in ifLT] can 
be applied to generate LDPC code degree distributions when 
using the EMI inner code. 

B. Maximal coding rate for the EMI-N code 

Theorem [T] proved that full diversity is achieved by the 
EMI code for any coding rate R c < 1. When P t changes 
a finite number of times N during a codeword transmission, 
([T6l > and thus the proof in App.lGl are not valid. The maximum 
coding rate yielding full diversity may be smaller than for the 
EMI code. In this section, we provide an upper bound on the 
maximal coding rate yielding full diversity for the EMI-A 
code, together with supporting numerical results. 



As the probability of occurrence of corrupt precoders is 
strictly larger then 7 _nr , the occurrence of corrupt precoders 
amongst the N precoders observed during the transmission 
of a codeword has to be considered. To derive the maximal 
coding rate yielding full-diversity, the maximal mutual infor- 
mation that can be achieved between input and output, when 
the input is transformed by a corrupt precoder, needs to be 
determined. Then, through a similar formula as Eq. (|63) . the 
maximal coding rate could be determined. 

For example, consider the different classes of bad pre- 
coders for the 2 x 2 MIMO channel and tt z = 4-QAM, 
which are given in Eq. (1381 . Each type of bad precoder 
corresponds to a maximum achievable mutual information 
7(x f ; y t |E, 14), which for the precoders from (l3Ft are given 
by {2, 2, log 2 9, log 2 12, log 2 12}. This can be verified ana- 
lytically or by means of simulation. Next, different subsets 
of the set of corrupt precoders will have a maximal mutual 
information that converges to the maximal mutual information 
of one of the bad precoders for increasing SNR. Each subset 
will have to be defined more precisely than in Defs. [2] and 
[3] i.e., some subsets will have more than one pair of z,z' 
yielding |s f i| < 7~ - 5 for some i. Therefore, some subsets 
might have a smaller probability of occurrence than what is 
given in Lemma [3] 

A detailed study of this behaviour is outside the scope 
of this work. However, we can provide an upper bound on 
the maximal coding rate yielding full diversity, by assuming 
an optimistic case. Instead of averaging I(x t ; yt|E, Vt) over 
Vt = V* Pt by changing P t , we can consider the average 
of 7(x t ;yt|i7) over N realizations of H, corresponding to a 
block fading MIMO channel with N states. The achievable 
diversity order for a block fading MIMO channel with TV 
blocks is upper bounded by l26l 

dup =n R (lNn T (l-Rc)\ + 1). (17) 
Hence, by putting d ap > nr, we get after some calculus that 

-Rc.max < 1 - 4? + 4?- (18) 

N nN 

Further work must verify whether this bound is tight for the 
EMI- AT code. 

The theoretical analysis is laborious, but conjectures can 
be made from the numerical simulations by comparing the 
diversity order of the EMI- AT code with that of the EMI code, 
which achieves full diversity (Fig. HJ. When using N = 2, it 
may be conjectured that the bound in Eq. ( fT8l is achieved as 
full diversity is still achieved for a coding rate R c = 0.75. 
When using N = 10, full diversity is still achieved for a 
coding rate R c = 0.9. However, at a coding rate R c = 0.95, 
which corresponds to the bound in Eq. (fT8l . full diversity is 
not achieved. 

Because it is difficult to design very high coding rate error- 
correcting codes (see B8l and references therein), coding rates 
larger than R c = 0.9 almost never occur in practice. Therefore, 
we only consider the EMI-10 code, achieving full diversity 
at coding rates smaller than or equal to R c = 0.9, in the 
remainder of the paper, unless mentioned otherwise. 
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(a) The outage boundary is shown at two SNR 
values: E b /N = 10 dB and E b /N = 13 dB. 
Only half of the plane is used, as we consider the 
ordered singular values, o\ > 02- 



(b) The mutual information 7(xt; yt|£, Vt) largely varies with Vt while 
(o"i,o"2) is on the outage boundary, especially when o\ 0-2- For a given 
E, the values of 100 mutual informations J(xt; yt|E, Vt) are displayed, each 
corresponding to a realization of Vt. The adopted SNR is Eb/No — 13 dB. 



Fig. 3: At the left side, the outage boundary is shown for 2 x 2 MIMO with Cl z = 4-QAM and R = 3.6 bpcu, which is 
the set of singular values (cri,fT 2 ) where E t [I(x t ; yt|E, Vt)) is equal to R. We approximated E t [/(x t ; y t |S, V t )] by averaging 
7(x t ; y t |S, V t ) over 100 randomly generated matrices V t . The variation of 7(x t ; y t |S, V t ) with V t is illustrated at the right 
side, where (<ti,<72) are located on the outage boundary (the outage boundary is a one-dimensional line, so that only one 
parameter, e.g. <7i, is sufficient to characterize the location on this line). 
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(a) The outage probabilities of the EMI-2 for 7? = {3, 3.2} bpcu 
are shown. It may be conjectured that full diversity is achieved 
for rates not larger than R = 3.0 bpcu, which corresponds to 
R c = 0.75. 
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(b) The outage probabilities of the EMI-10 for R — 
{3.6, 3.7, 3.8} bpcu are shown. It may be conjectured that full 
diversity is achieved for rates R < 3.6 bpcu, which corresponds 
to R c = 0.9. 



Fig. 4: It can be numerically verified whether the EMI-A^ code (EMI-2 at the left side and EMI-10 at the right side) achieves 
full diversity by comparing its outage probability to that of the EMI-code, which always achieves full diversity. Simulation 
parameters: 2x2 MIMO with Q z = 4-QAM and different rates. 



C. Numerical results for the tit x hr MIMO, riR> tit 

When nn > tit, a transformation of tit independent 
symbols are transmitted at each channel use. We compare the 
outage and error rate performance of the EMI-10 code with 
STCs from the literature that are optimal in terms of uncoded 
error rate (e.g., |0], lETI and flS]) or optimal in terms of error 
rate assuming a genie condition (e.g., |8j and |26|). We also 
include the error rate performance without precoding (which 
is equivalent to a fixed space-only code) for reference. 

First, let us consider the 2 x 2 MIMO channel and fl z = 4- 
QAM. Fig. [5] compares the performances of the Golden code 



ID with the EMI-10 code and the case without precoding. 
The performance of other STCs, such as J8], ||26) were also 
tested, but their error rates are slightly larger than that of the 
Golden code, so that they are not displayed for clarity. The 
LDPC code of R c = 0.9 is a regular (3, 30) LDPC code, 
which showed the best performance among other tested LDPC 
codes of the same coding rate. The code length of the LDPC 
code is Nf, = 5760. As theoretically proved in Theorem Q] full 
diversity is achieved by the EMI code. The focus of this paper 
was on full rate, full diversity, and low decoding complexity. 
Of course, the significant reduction in complexity comes at the 
expense of a loss in coding gain (the EMI and EMI-10 code 
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Fig. 5: For a coding rate R c = 0.9 (i.e., R = 3.6 bpcu) and 
a 2 x 2 MIMO channel, the outage probabilities of the EMI 
and EMI- 10 code achieve full diversity and perform at 1.5 and 
2 dB from the outage probability of the Golden code. "Out" 
stands for outage probability, the other curves represent the 
WER of a practical system with an error-correcting code. 
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Fig. 6: For a coding rate R c = 0.75 (i.e., R = 3.0 bpcu), 
the horizontal SNR-gap between the outage probabilities of 
the EMI-10 code and the Golden code reduced from 2 dB 
(for R c = 0.9) to 0.5 dB. Other simulation parameters: 2x2 
MIMO, "Out" stands for outage probability, the other curves 
represent the LDPC code WER. 



respectively require 1.5 and 2 dB more energy to have the 
same error rate performance than the Golden code). However, 
as shown in Figs. [6] and |7J this loss in coding gain vanishes 
when the coding rate decreases. More spefically, the loss in 
coding gain is around 0.5 dB when R c = 0.75 and dB when 
R c = 0.55. In Figs. [6] and [7] the outage probability of the 
EMI and EMI-10 code coincide, so that the EMI code is not 
shown for clarity. Similarly, the error rates of the Golden code 
and the STCs from JD, ll26l practically coincide. The degree 
distributions of the LDPC code were obtained as explained in 
Sec. I VIII-AI and are given by 

AO) = 0.231a: + 0.543a; 2 + 0.226x 19 ; p{x) = x 12 

for R c = 0.75 and by 

\{x) = 0.215x + 0.465a 2 + 0.040a 19 + 0.280X 20 ; p( x ) = a 7 

for R c = 0.55. The code length is N b = 5760. 

Next, we consider the 2x3 and 3x3 MIMO channels using 
Q, z = 4-QAM. Figs. [8] and [9] compare the performances of 
approximately universal STCs (the Golden code JfTj for nx = 
2 and the Perfect STC EH for n T = 3) with the EMI-10 
code and the case without precoding. Other STCs, such as 
0, l26l for tit = 2, have a similar or slightly degraded error 
rate performance, so that they are not displayed for clarity. 
The LDPC code of R c = 0.9 is again a regular (3, 30) LDPC 
code and the code length of the LDPC code is Nb = 5760. 
Full diversity is achieved by the EMI-10 code and as for the 
2x2 MIMO channel, the significant reduction in complexity 
comes at the expense of a loss in coding gain when the coding 
rate is close to one. When R c = 0.9, this loss is approximately 
1.5 dB. 

Finally, we consider a larger constellation size for the 2x2 
MIMO channel, using VL Z = 16-QAM (Fig. [10>. The other 
parameters, such as the degree distributions, coding rate and 
the code length are the same. The performance of the EMI- 
10 code is compared with those of the Golden code, the 
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Fig. 7: For a coding rate R c = 0.55 (i.e., R = 2.2 bpcu), 
the horizontal SNR-gap between the outage probabilities of 
the EMI-10 code and the Golden code has vanished. Other 
simulation parameters: 2x2 MIMO, "Out" stands for outage 
probability, the other curves represent the LDPC code WER. 



cyclotomic code l26l and Aladdin-Pythagoras code [8|. The 
loss in coding gain of the EMI-code is not larger than 1.5 dB. 



D. Numerical results for the tit x tir MIMO, tir < tit 

When njj < tit, a transformation of, on average, 
independent symbols are transmitted at each channel use. For 
the 2x1 MIMO channel, the Alamouti code is well known 
to be optimal in many ways, e.g. in terms of uncoded error 
rate. The loss of coding gain of the EMI- TV code is quite large 
(e.g., more than 3 dB for R c = 0.9) when the coding rate is 
close to one, but decreases with the coding rate (e.g., around 
1.5 dB for R c = 0.75), see Fig.Ql] It may be conjectured that 
the EMI-A^ code can not compete with the Alamouti code for 
the 2x1 MIMO channel, given that the Alamouti code also 
has a low detection complexity. 
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No precoding ■ 
EMI-20 ■ 
Alamouti - 




(a) For R c = 0.9 (i.e., R = 1.8 bpcu), the outage probability of the 
EMI-20 code performs at approximately 4 dB from the Alamouti 
code. 




(b) The horizontal SNR-gap with the Alamouti code reduced to 
approximately 1.5 dB for R = 1.5 bpcu (R c = 0.75). 



Fig. 11: The outage probability of the EMI-iV code is compared with the Alamouti STC for the 2x1 MIMO channel with 
Q z = 4-QAM and rates 1.8 bpcu (left) and 1.5 bpcu (right). Full diversity is achieved but the loss of coding gain is large. 
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Fig. 8: The EMI-10 code also achieves full diversity for 
R c = 0.9 (i.e., R = 3.6 bpcu) and the 2 x 3 MIMO channel, 
performing at approximately 1.5 dB from the Golden code. 
"Out" stands for outage probability, the other curves represent 
the LDPC code WER. 




E b /N„ [dB] 

Fig. 9: Also for the 3 x 3 MIMO channel and R c = 0.9 
(i.e., R = 5.4 bpcu), full-diversity is achieved by the EMI- 
10 code, again performing at approximately 1.5 dB from the 
approximately universal code, denoted as perfect STC. "Out" 
stands for outage probability, the other curves represent the 
LDPC code WER. 



For completeness, we also show the outage probability of 
the EMl-N code for the 3 x 1 MIMO channel (Fig.Qj]), which 
corroborates Theorem Q] i.e., full diversity is achieved, also 
when nji < tit- 

IX. Conclusion 

For unknown channel state information at the transmitter, 
we proposed a new coded modulation paradigm: outer error- 
correction code with space-only but time-varying precoding. 
Full diversity is proved in terms of the outage probability and 
verified by means of numerical simulations. The proofs are 
based on the relation between MIMO and parallel channels. 
For most coding rates, using ten space-only precoders per 
codeword is sufficient and the error rate performance of 
LDPC coded modulations approaches the outage probability. 
The detection complexity of space-only codes is significantly 
smaller than for approximately universal space-time codes, 
coming at a price of coding gain loss for coding rates R c 



close to one (e.g. 1.5 dB for R c — 0.9), which fortunately 
vanishes with decreasing coding rate. 
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Appendix 

A. Random Unitary Matrices 

When a unitary matrix is uniformly distributed in the set 
of all unitary matrices with a particular dimension, then 
formally, it is uniformly distributed in the Stiefel manifold with 
respect to the Haar measure. This section is not a part of our 
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Fig. 10: Also for higher constellation sizes (here Q z 
QAM), the EMI-10 code achieves full diversity in terms of 
WER. Other simulation parameters: 2x2 MIMO, R c = 0.9 
(R = 7.2 bpcu). 
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Fig. 12: We show that the outage probability of the EMI-20 
code achieves full diversity on the 3x1 MIMO channel for 
n z = 4-QAM and R c = 0.9 (i.e., R = 1.8 bpcu). 



contribution, but we briefly summarize the concepts of Stiefel 
manifold and Haar distribution. For a thorough description, 
we refer to ll9l. Il33l. 

A unitary matrix V € £n T xn T belongs to the Stiefel 
manifold A4 nT ,nT> which is the set of all unitary matrices 
in C nTXnT El Sec. III.A] E3 Sec. 2.1.4] H2J Sec. 3.2] M 
Sec. 4.6]. More specifically, M. nT .n T is a subset with n|. real 
dimensions of the inner product space C" tX " t which has 2n 2 
real dimensions. 

In order to consider a probability density function (pdf) for 
V, a means for integration must be defined first, as the integral 
of the pdf over the considered space, here the Stiefel manifold, 
equals one. In the Stiefel manifold, Riemann integration is 
not applicable so that Lebesgue integration, which is a gen- 
eralization of Riemann integration to more general spaces, is 
required j9j Sec. I.D]. A Riemann integral is the sum of the 
volumes of hyperrectangles under the considered function and 
thus partitions the support of that function, while Lebesgue 
integration partitions the function output, so that it can be used 
to integrate functions with a support in more general spaces, 
such as manifolds. However, it requires the definition of a 



measure Sec. I.D], which is a generalized volume element, 
for example to measure the volume of the support where the 
function output belongs to a particular interval. A measure [i, 
defined on a group G, is a Haar measure if 

fi{gE) = fi(E),Vg eG,V£c G, 

where E is a measurable subset of G 1PJ Sec. I.D] and gE = 
{g.e : e £ E}. In this case, consider £ to be a subset of 
■Mn T ,n.T and G to be the unitary group of degree t%t- Consider 
the probability measure^ 

d/i = 



dV 



Vol(M nTt n T )' 

where dV is the differential volume element of M nT „ T 



dV = Vol(M nT ,n T )) and Vol(AL 



is the total 



volume of M. nTtUT ifTTl Sec. 3.2]. It can be shown that this 
measure is a Haar measure ll33] Sec. 2.1.4] |fT7] Sec. 3.2] |[T6] 
Sec. 4.6]. Furthermore, this probability measure defines the pdf 
r, which is the uniform distribution in the 



P( V ) = Vbl(A4„ T ,„ r ) ' 

Stiefel manifold. Hence, the considered Haar measure implies 
a uniform distribution, and there exists exactly one probability 
measure which is Haar. Therefore, matrices that are uniform 
within the Stiefel manifold with respect to the Haar measure 
are also denoted to be Haar-distributed ifTBI Sec. 4.6]. 

Note that Lebesgue integration is often avoided by using 
Jacobians, which actually determine the differential volume 
element dV lfl7l Sec. 3.2]. Similarly to the Stiefel manifold 
M-n T ,n T , one can also consider other Stiefel manifolds Aiij 
for some i and j. 

The following lemma collects some useful properties of 
unitary matrices. 

Lemma 5: Consider the unitary matrices D,U,V £ 
C nrX " T , where U and V are independent and uniformly 
distributed in Ai nT . nT , and D is deterministic. Then 

1) The linear transformation 



C£ : /(b) = Ub = a, 



is a bijection. Next, is uniformly distributed in 
■MhtA f° r an y I-* independent of U. 

2) U' = DU is uniform in AA nT . nT - 

3) V = VU is uniform in M. nT _ nT . 

Proof: 

1) Any non-singular matrix U guarantees that Uhi ^ U\>2 
for bi 7^ \>2, as U^ 1 U\)2 ^ bi, yielding a bijection. 
Consider any c in M. nT ,i- Consider Q so that c = 



Q 



|b|| 



Matrix Q always exists; more specifically, Q 



CA', where C and A are unitary matrices with c 



and 



IflT 



on its first column, respectively. Hence c 



QJ/ypjyy, where QU is uniformly distributed in M. nT ,n T 
(see third bullet). Hence QU can be considered as 
another realization of the random matrix U (hence with 
the same probability density by the uniformity) and thus 
c as the corresponding realization of the random ^yj, 
with the same probability density. Since this holds for 

6 A probability measure is a measure which integrates to one over the 
considered space. 
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any c, all unitary vectors have the same probability 
density, yielding the uniform distribution in Ai nT .i- 

2) For each D, there exists exactly one U yielding U' 
(because D is a bijection), so that p(U') = p(U). 

3) For each U, there exists exactly one V yielding U' 
because U is a bijection. Hence, 

p(U')= I p(V = U'U^\U)p{U)dU (19) 



p{V)p(U)dU 



(20) 



M„ 



p(V) / p(U)dU=p(V), (21) 



which yields the claim. 



B. Proof of Lemma [7J 

The diversity order is upper bounded by the diversity order 



of the outage probability, P out 



p(8) 



Consider Tg = 7 



= Jo* P( e ) P °<*\6 de > Where 

then 



Pout ^ Pea 



> Pn, 



p{6)d8 



p(0)P outl gd6 (22) 



p(6)d6 = P outle=en 



-0.5 



(23) 



- is a sufficient 

7 



where 61 = argmin P out | 9 . Hence, P ou ,| 0= 
ee[o,T e ] 

condition to have that P ou t>^r-x- 

Let us study the mutual information when 6 G [0, Tg]. 
Similarly as in Eq. [T2] we express the mutual information 
between BPSK input and output of the parallel channel in 
Eq. ( 1251 ), where o denotes the component-wise multiplication, 
also known as the Hadamard product or Schur product, and 
j3 = [/3i B 2 \. As was first done in ll52l and then later in ||22] 
App. I] for Rayleigh fading, we define the normalized fading 
so that the mutual information can be 
where 



gains a,i 



log 7 

expressed as in Eq. 



f(pti,st t i,w tt i) = e 7 >s *. 

and where s M = (x tji - x' t i ). 
We recall the distribution of a 



p{a) = (log7) 2 e 7 



(ai,a 2 ) 



7 



(24) 

App. I], 

(27) 



We denote on as an e-bad fading gain when on > 1 + e and 
as an e-good fading gain when on < 1 — e, for e > 0. 

The outline of the remainder of the proof is as follows. 
We first determine a region A in the space of (ai, 012) where 
the mutual information, given a fading point in that region, 
is smaller than the spectral efficiency (the spectral efficiency 
R = mR c equals 2R C in this case) corresponding with a 
coding rate R c > 0.5. The outage probability will then be 
given by the integral f.p(a)da.. As we are only interested in 
the SNR-exponent of the outage probability, we can simplify 
the integral following the same lines as in 11521 Sec. III-B], 



22], by replacing p(a) by q(a) 
q : M 2 -+ -> 



q( a )= 1 - a ^, 



(28) 



because log(7) 2 =l, e -7 1 decreases exponentially with 7 
if ai < and approaches 1 and e for a, > and on = 
respectively, so that we only have to consider a £ R 2 ' + where 
we can drop the exponential term. The set K 2 ' + corresponds 
to all positive real-valued two-dimensional vectors. Hence, 

/ p(a)da= / q(a)da. (29) 

J A JArWL 2 <+ 

As in El . we can apply the dominated convergence theo- 
rem llTTl Theorem 1.6.7] to determine the diversity order (Eq. 
©) 



lim E Wi log 2 ( ^2 JJ/(ai)St,i)iOt,i) 



E w , lirn^loga ^ JJ f(pn, s t ,i, w t>i ) | (30) 

When 9 — s- 0, we have that for each x t and i, 

min|s M | = V2(6» - 0(6» 3 )). (31) 

More specifically, for each x t and i, there exists exactly 
one x' ( 7^ x t so that \s t ,i\ < V2T e when 6 <E [0,Tg] fl 
As a consequence, \st,i\<Tg and it is easy to verify that 
lim 7 _,.oo f(ai, \st,i\<Tg, w t ,i) = 1 if on > 0. The mutual 
information for large 7 can now be written as 

-4 E Wt [log 2 (l + <?(a,x t ))], 



i{^ t -y t \p x ,p 2 ,e) 

where, for all x t , 

g(a,x t ) = 



0(exp[-7 e ]), if ax,a 2 < 1, 

Q (cxp [— 7 _c ]) if ol\ > 1 or a 2 > 1, 

(32) 

where e = |1 — maXjO:i|. Hence, if there exists one e-bad 
fading gain, on > 1 + e, then it can be shown that the 
mutual information is 1 — SI (j~ e ), thus strictly smaller than 
one. Therefore, Pr(J (x t ; y t |a, 9 < Tg) < 2R C ) = 1 when 
R c > 0.5 and a G A e = {a. : Y? i=1 H<Xi > 1 + e} > 1} 
(1{.} is the indicator function). As a consequence, for every 
e > and R r > 0.5, we have that 



> Pr(A)= 



Following the same lines as in 
the SNR exponent d ut|e=e, is 



-4 t nR 2 .+ 

Theorem 4], 



q(a)da. (33) 
" App. I], 



d ut\e=e, < rfout(e) = inf 



2 

E 

j=i 



1 



(34) 



This holds for any e > 0, and the bound in Eq. ( f34b can be 
made tight taking the infinum inf e c? out (e) (see e.g. 1221 . l23l . 



351 ). Combining with 



we obtain d out < 0.5 + d out | e=8l < 



7 Note that it is impossible to have \s t ^\ — > 0, for i = 1,2 (see Lemma 



14 



I(jct ]yt \0x,02,O) = 2-± ]T E y^ 
I{x t] y t \0 1 ,0 2 ,O) = 2-\ £ E 



log 2 £ cxp[(d 2 (y t ,^/3ox t )-d 2 (y t ,^/3ox' t ))] 



iog 2 ( e n^( Q;i ' st ^' u;t ^) 



(25) 



(26) 



1.5. 



C. Sad precoders for Sl z = A-QAM and 2x2 MIMO 

Given Q z , all bad precoders can be identified, for example 
by listing all possible difference vectors z — z' and conse- 
quently determining the structure of the precoder so that 3 i 
where st,i = 0. It can be verified that for Q z = 4— QAM 
(corresponding to m = 4), the bad precoders Vt> a d is the set 
in Eq. (l38l l. where w, ip,(f>,d E [0, 2ir] and where 



6 = 



fc G {0,...,3} 



7T 7T 

-4+^2' 
(fc0-2)|. 



h e {o,...,3} 



(35) 
(36) 
(37) 



For each precoder V t <G V^, the maximum mutual informa- 
tion /(xt; yt|Vf ) can be determined. It can be verified that for 
the first two types of bad precoders in Eq. (l38l l. the maximal 
mutual information is ^ = 2. This laborious study of bad 
precoders is only necessary to analyze the case with the time- 
varying precoder when a small finite number of unitary matri- 
ces is used. Fortunately, the average mutual information when 
N precoders are used rapidly converges to the expectation of 
the mutual information, E t [I(x t ; y t | Vt)], which corresponds 
to N — > oo (see Sec. IVIII-Bb . In Theorem Q] we assume 
that N — >• oo and thus only consider Et[/(xt; y*|Vt)]. The 
convergence properties to this case for finite N are discussed 
in Sec. IVI11-BI 

D. Probability of corrupt precoders 

Before we determine the probability of corrupt precoders, 
we consider the following Lemma. 

Lemma 6: Consider two independent random variables X 
and Y which are up to a constant % 2 -distributed with param- 
eter 2a and 2b, respectively. Then the random variable 

Z= ~P(a,b), 

where j3(a, b) is the Beta-distribution with parameters a and 
b, respectively. 

Proof: X ~ T(a, 6 = 2) X ~ x 2 (2a). A property of 
Beta-distributions J29] Chapt. 25] is that if X ~ T(a,0 = 2) 
and Y ~ T(b, = 2), then Z = ~ j3{a, b). ■ 
Consider the unitary matrix Vt, which is uniformly dis- 
tributed in the set of all unitary matrices A4 (tit, tit) (see App. 
IA1 . Consider any pair z, z' 6 fl z and define \r t = V r^E^rn , 
so that s t = ||z — z'||v{. It follows from Lemma [5] that 
Vt is uniform in A4 nT j. As a consequence of the uniform 



distribution, it is shown in ifTTl Theorem 3.1] that v< can be 
constructed as 



v* 



|g| 



(39) 



where g — C7V(0, 1). Hence, 

. 2^i=2 \9i\ . i |2 



E 

i=2 



H=2 

™ T l/i.|2 



and u t . 



Erjii^i 2 ' 



where g ~ CN{0,I). Because |gj| 2 ~ X 2 ( 2 ) and 
S™=2 Ifil 2 ~ X 2 (2(nT — 1)) (up to a constant), we have 
*at Y.Z2 \vt.A 2 ~ P(nr - U) and \v t A 2 ~ /3(l,n T - 1) 
by Lemma [6] Now, we can determine the probability that 
{|s*,i| 2 < 7~\ * > 1} for this given pair z, z' G fi z , denoted 
by PCP (pairwise corruption probability), as 

PCP = Pr(| S t. 2 | 2 <7~\---,kn T | 2 <7 _1 ) (40) 
The PCP is upper and lower bounded as follows 

Pr(f>M| 2 <7~M <PCP 



<Pr £| Sm | 2 <(«t-1)7^ 



,i=2 



(41) 



Because v t is proportional to Sf, we have 
Pr(X>M| 2 <7 _1 )<PCP 



,i=2 



(42) 



which is 



-(«t-i)> 



(?IT-1) 



7 v" ^<PCP<7 
by App. |E] Hence, 

pcp = 7 -(™ t - 1 ). 

The probability of a corrupt precoder is bounded as 
PCP < Pr(5 C) i) < — -PCP 



yielding 



Pr(5 c ,i) = 7 



-(nT— 1) 



(43) 
(44) 

(45) 
(46) 



The lower bound in d45l > is because the PCP only considers one 
of the - — ^ 2 2 pairs z, z' <E il z . The upper bound neglects 
the correlation between the different pairs z, z' 6 il z . 
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(38) 



The proof for Pr(6> Ci 2) follows the same lines by considering 
the components {s t .i,i = l,...,r} instead of {sf.;,i = 
2,...,n}. 



£. Asymptotic cdf of Beta distribution 

We are interested in Pr(y < x) for small where y ~ 
/3(1, riT — 1) or y ~ (3(riT — 1, 1)- The cdf of a Beta-distributed 
random variable y with parameters a and 6 is 



Fy(x,a,6) = / 
Jo 



* r(a + 6) a; Q - 1 (l-.T)''- 1 d 2; . (47) 



/o r(a)r(t) 

Hence, 

F Y (x, l,n T - 1) = -(1 - x)^- 1 + 1 = (n T - l)x - 0(x 2 ) 

(48) 

F Y {x,n T - 1,1) = / T_1 (49) 
where it is assumed that |x| < 1 in ( |48T >. 

^ Proof of Lemma [4] 

The achievability is shown by simply using one transmit 
antenna and maximum-ratio combining (ML detection) at 
the receiver. For the converse, we lower bound the outage 
probability as follows. 



P 



(50) 



5 ou, = / p(V')P 0Ut \ v ,dV + I P (V')P 0Utlv ,dV 

JS C JSa 

> Pout\V>=V, [ p(V')dV = P out \y=vMSc) (51) 
Js c 

where S c is the complement of S c and V\ = argmin P out \v- 

V'eS c 

First, let us study Pout|V'=Vp which is the outage probability 
given that V = Vi . The outline of the remainder of the proof is 
the same as in the proof of Lemma [T1(App. IB1. We determine 
a region A in the space of a where the mutual information 
(given in Eq. [T3l is smaller than a particular spectral efficiency 
R. The outage probability, given by f A p(ct)da, can then be 
simplified following the same lines as in ll52l Sec. III-B], l22l 
because we are only interested in its SNR-exponent. More 
specifically, it can be verified that 1521 Sec. III-B], 



p(a)da= / q(a)da, (52) 

A J.4nR min< "T."-R>>+ 



where 



. jjgmin(nT,nfi)- + 
min(n^,7iT) 

j(a) = 7 



— (2-i— 1+\ur— ut\)c 



(53) 



In order to determine the diversity order (Eq. d§}), we 
can apply the dominated convergence theorem liTTl Theorem 



1.6.7], |[22] App. 2], so that 

(TIT \ 
X 'ea x »=i / 

lim log 2 J2 Y[f( a h s ^ w i) 



lim E v 

7— >oc 



= E v 



(54) 



Consider first the case that tir > ut- For corrupt precoders, 
3 z c ,z' c 7^ z c , satisfying |s<| < 7~°- 5 ,Vi > 1. In the 
case of a corrupt precoder, we observe that for large 7 and 
i > 1, lim 7 ^oo /(oj, Si,Wi) = 1 if on > 0. For large 7, the 
instantaneous mutual information is 



/ (x; y|£, V G <S c ,i) = m-2" m ]T E w [log 2 (1 + g(a, x))] , 
where 



fl(a>Xc) 



and 



O [ e~T 

n ( e^(^)° 1_1 

Note 



V'z r 



Exvx nr=i /( a i> s *> 



if «i < 1 
, if «i > 1 

that g(a,x) 



As a consequence, in the event that a G .A £ .i, the instanta- 
neous mutual information is m — f2 (exp [— 7 _£ ]), where 



A tt i = {a : a\ > . . . > a nT > : ot\ > 1 + e}, 



(55) 



and e > 0. Hence, there exists a coding rate so that the spectral 
efficiency mR c is always larger than or equal to the mutual 
information, given that a G A e ,\. For this coding rate, the 
outage probability is 



-Pout|v"es c .i ^ 



p(a)da= I q(a)dat. (56) 

A eA JA.,1 



Noting that Pr( l S c ,i)=7~(™ T ~ 1 ) and thus 
f out> 4-' -^out| v eS c 1 i an d following the same lines 
as in lf52l Theorem 4], 11231 . we obtain the SNR-exponent 



<4ut < dout(e) = ™t - 1 



inf 



E 

i=l 



(2t — 1 + Ujj — 7lr)Qj. 

(57) 

The infinum is (njj — + 1)(1 + e), which is achieved when 
a nT = ...=^2 = and ai = 1 + e. This holds for each 
e > 0, and the bound in Eq. ( |57] | can be made tight taking the 
infinum inf e d out (e) (see [23]), hence we obtain rf out < ur. 

Now consider the case that tir < tit- For corrupt precoders, 



3 z c ,z' ^ z c , satisfying |s,| 2 < 7" 



-l-e» 



> 0, V % 



1, . . . , 71r. In the case of a corrupt precoder, we observe that 
for large 7, lim^oo /(ai, s^, Wi) = 1 if a, > 0. For large 7, 
the instantaneous mutual information is 



J(x;y|E,V'e5 c , 2 ) 



E w [log 2 (l + g(a,x))] , 
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where 

g(oL,x c ) = Q (ew) 



, x c = V'z c 



(58) 



Hence, for large 7, the mutual information is therefore 
to - Q ^ e -7- max ' (a ' +tl) ^ for a ii q, e Hence, there 

exists a coding rate so that the spectral efficiency mR c is 
always larger than or equal to the mutual information. For this 
coding rate, the outage probability P ut\v'es c 2 =1- Noting that 
Pr(S c%2 )=7~" R and thus p oat>^Paat\V'es e , 3 > we obtain that 



G. Proof of Theorem Q] 

Similarly to S c , we define a larger set S c ,3, which is the 
set of precoders V t so that 3 i < mm(riT, tir), z, z' 7^ z, 
satisfying \s t i \ 2 < (log'7) _p , for any p > 0. The probability 
Pr(Vt G <S C 3) — > for large 7 (see App. [__). Denoting 

7(x t ;y t |S,\4) as /(V*), we can write E t [I(V t )] as 

Et [J(V t )] = f p(V t )I(V t )dV t + [ p(V t )I(V t )dV t . 

(59) 

> /i„ = I(Vi) f p(V t )dV t = J(V,)(1 - Pr(5 c , 3 )), 

JS C ,3 

where VJ = argmin 7(Vt) (worst case). Thus, we have that 

v t es c , 3 

P out = Pr(E t [I{V t )\ <R)< Pr(/ m < R), (60) 
By definition of 5 Ci 3, \st i\ > (log7)~ p , so that for large 7, 

f 0(e-^^*) , ai <l 
f(ai,s ti ,Wt,i)=< } f i\c.i-i\ (61) 
[ n (e-UJ J , oj > l. 

Hence, if [)™ ( " t '" r) < 1} > 1 (1{.} is the indicator 
function), then I(Vi) — > m for large 7. More specifically, 
I(Vi) — > m for large 7 when a £ _4 f , where 

min(nT ,tir) 

A c = {a: H a i < 1 - e} > 1}, (62) 

i=l 

for e > 0. Note that the fading gains are ordered, so that this 
set is equivalent to 

Ae • Q-l ^ • • • ^ (%min(nT ,n r) ^ ■ ^mir^nT^nj?) ^ 1 

As a consequence, J(Vj|a € „4 C ) = m — O (e r ). Note that 
for large 7 and i? c = 1 — £2, £2 > 0, 

lim Pr(/(V r / |a € A e )(l - Pr(S c , 3 ) < .Rc-to) = 0. (63) 

7— >oo 

In other words, Pr(/; n < R) < Pr(.4 e ). Hence, for large 7 
and any 6,62 > 0, the outage probability is upper bounded 
by Pr(A), where A e = {a : _{ a . > 1 - e } = 



i(nT,nn)} or A e = {a. : a\ > . . . > 



^iiiin(nT ,nn) 



> 



Q : min(7i T ,nf i ) > 1 ~ £ }- Following the same lines as before (or 
see _Z_L 1521), 



rfout > sup inf (2i - 1 + \n R - n T \)a t (64) 



By letting €2 — >• + , we obtain rf out = utub, for any i? c < 1. 

//. Probability of 6> c3 

Similarly as in App. [D] we consider v t = Vt 1 i *~*, 1 1 , uni- 
form in the Stiefel manifold M. nT 1, so that s t = ||z — z'||v t ; 
and we consider the PCP so that 

2 m (2 m 1) 

Pr(5 c , 3 ) < — ^ ^PCP (65) 

where in this case 

PCP = Pr(| SM | 2 < (log 7 )- p ) (66) 

for at least one i £ [1, . . . , min(nT, tlr)]. Hence 

min(riT 

PCP< J2 Pr (M 2 < (log7)" p ) (67) 

i=l 
min(riT 

= ^ Pr(|^| 2 <(log7)- p ) (68) 

i=l 
min(nT 

= ^ (n T -l)(log 7 )-P-0((log 7 )~ 2p ), (69) 

i=l 

where d68l l follows from s t being proportional to v t , where 
j69l ) follows from App. [E] noting that \v t .i\ 2 ~ /3(l,riT — 1) 
(see App. ID). Hence, for large 7, Pr(5 c .3) — > 0. 
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